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TITLE OF THE INVENTION 
SYNTHETIC HEPATITIS C GENES 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 Not applicable. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not applicable. 

1 0 REFERENCE TO MICROFICHE APPENDIX 
Not applicable. 

FIELD OF THE INVENTION 
Not applicable. 

15 

BACKGROUND OF THE INVENTION 

This invention relates to novel nucleic acid pharmaceutical 
products, specifically nucleic acid vaccine products. The nucleic acid 
vaccine products, when introduced directly into muscle cells, induce the 3 
20 production of immune responses which specifically recognize Hepatitis 
C virus (HCV). 

Hepatitis C Virus 

Non-A, Non-B hepatitis (NANBH) is a transmissible disease 

25 (or family of diseases) that is believed to be virally induced, and is 

distinguishable from other forms of virus-associated liver disease, such 
as those caused by hepatitis A virus (HAV), hepatitis B virus (HBV), 
delta hepatitis virus (HDV), cytomegalovirus (CMV) or Epstein-Barr 
virus (EBV). Epidemiologic evidence suggests that there may be three 

30 types of NANBH: the water-borne epidemic type; the blood or needle 
associated type; and the sporadically occurring (community acquired) 
type. However, the number of causative agents is unknown. Recently, a 
new viral species, hepatitis C virus (HCV) has been identified as the 
primary (if not only) cause of blood-associated NANBH (BB-NANBH). 
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Hepatitis C appears to be the major form of transfusion-associated 
hepatitis in a number of countries, including the United States and 
Japan. There is also evidence implicating HCV in induction of 
hepatocellular carcinoma. Thus, a need exists for an effective method 
5 for preventing or treating HCV infection: currently, there is none. 

The HCV may be distantly related to the flaviviridae. The 
Flavivirus family contains a large number of viruses which are small, 
enveloped pathogens of man. The morphology and composition of 
Flavivirus particles are known, and are discussed in M. A. Brinton, in 

10 "The Viruses: The Togaviridae And Flaviviridae" (Series eds. Fraenkel- 
Conrat and Wagner, vol. eds. Schlesinger and Schlesinger, Plenum 
Press, 1986), pp. 327-374. Generally, with respect to morphology, 
Flaviviruses contain a central nucleocapsid surrounded by a lipid 
bi layer. Virions are spherical and have a diameter of about 40-50 nm. 

15 Their cores are about 25-30 nm in diameter. Along the outer surface of 
the virion envelope are projections measuring about 5-10 nm in length 
with terminal knobs about 2 nm in diameter. Typical examples of the 
family include Yellow Fever virus, West Nile virus, and Dengue Fever 
virus. They possess positive-stranded RNA genomes (about 1 1 ,000 

20 nucleotides) that are slightly larger than that of HCV and encode a 
polyprotein precursor of about 3500 amino acids. Individual viral 
proteins are cleaved from this precursor polypeptide. 

The genome of HCV appears to be single-stranded RNA 
containing about 10,000 nucleotides. The genome is positive-stranded, 

25 and possesses a continuous translational open reading frame (ORF) that 
encodes a polyprotein of about 3,000 amino acids. In the ORF, the 
structural proteins appear to be encoded in approximately the first 
quarter of die N-terminal region, with the majority of the polyprotein 
attributed to non-structural proteins. When compared with all known 

30 viral sequences, small but significant co-linear homologies are observed 
with the nonstructural proteins of the Flavivirus family, and with the 
pestiviruses (which are now also considered to be part of the Flavivirus 
family). 
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Intramuscular inoculation of polynucleotide constructs, i.e., 
DNA plasmids encoding proteins have been shown to result in the in situ 
generation of the protein in muscle cells. By using cDNA plasmids 
encoding viral proteins, both antibody and CTL responses were 

5 generated, providing homologous and heterologous protection against 
subsequent challenge with either the homologous or cross-strain 
protection, respectively. Each of these types of immune responses 
offers a potential advantage over existing vaccination strategies. The 
use of PNVs (polynucleotide vaccines) to generate antibodies may result 

10 in an increased duration of the antibody responses as well as the 
provision of an antigen that can have both the exact sequence of the 
clinically circulating strain of virus as well as the proper post- 
translational modifications and conformation of the native protein (vs. a 
recombinant protein). The generation of CTL responses by this means 

15 offers the benefits of cross-strain protection without the use of a live 
potentially pathogenic vector or attenuated virus. 

Therefore, this invention contemplates methods for 
introducing nucleic acids into living tissue to induce expression of 
proteins. The invention provides a method for introducing viral 

20 proteins into the antigen processing pathway to generate virus-specific 
immune responses including, but not limited to, CTLs. Thus, the need 
for specific therapeutic agents capable of eliciting desired prophylactic 
immune responses against viral pathogens is met for HCV vims by this 
invention. Of particular importance in this therapeutic approach is the 

25 ability to induce T-cell immune responses which can prevent infections 
even of virus strains which are heterologous to the strain from which 
the antigen gene was obtained. Therefore, this invention provides DNA 
constructs encoding viral proteins of the hepatitis C virus core, envelope 
(El), nonstructural (NS5) genes or any other HCV genes which encode 

30 products which generate specific immune responses including but not 
limited to CTLs. 



WO 97/47358 



PCT/US97/09884 



.4. 

DNA Vaccines 

Benvenisty, N., and Reshef, L. f PNAS 83, 955 1 -9555, 
(1986)] showed that CaCl2-precipitated DNA introduced into mice 
intraperitoneally (i.p.), intravenously (i.v.) or intramuscularly (i.m.) 
5 could be expressed. The Lm. injection of DNA expression vectors 
without CaCl2 treatment in mice resulted in the uptake of DNA by the 
muscle cells and expression of the protein encoded by the DNA . The 
plasmids were maintained episomally and did not replicate. 
Subsequently, persistent expression has been observed after i.m. 

10 injection in skeletal muscle of rats, fish and primates, and cardiac 
muscle of rats. The technique of using nucleic acids as therapeutic 
agents was reported in WO90/1 1092 (4 October 1990), in which 
polynucleotides were used to vaccinate vertebrates. 

It is not necessary for the success of the method that 

15 immunization be intramuscular. The introduction of gold 

microprojectiles coated with DNA encoding bovine growth hormone 
(BGH) into the skin of mice resulted in production of anti-BGH 
antibodies in the mice. A j^t injector has been used to tiansfect skin, 
muscle, fat, and mammary tissues of living animals. Various methods 

20 for introducing nucleic acids have been reviewed. Intravenous injection 
of a DNAxationic liposome complex in mice was shown by Zhu et ah, 
[Science 261:209-21 1 (9 July 1993) to result in systemic expression of a 
cloned transgene. Ulmer et al., [Science 259: 1 745- 1 749, ( 1 993)] 
reported on the heterologous protection against influenza vims infection 

25 by intramuscular injection of DNA encoding influenza virus proteins. 

The need for specific therapeutic and prophylactic agents 
capable of eliciting desired immune responses against pathogens and 
tumor antigens is met by the instant invention. Of particular 
importance in this therapeutic approach is the ability to induce T-cell 

30 immune respoases which can prevent infections or disease caused even 
by virus strains which are heterologous to the strain from which the 
antigen gene was obtained. This is of particular concern when dealing 
with HIV as this virus has been recognized to mutate rapidly and many 
virulent isolates have been identified [see, for example, LaRosa et aL 
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Science 249:932-935 (1990), identifying 245 separate HIV isolates]. In 
response to this recognized diversity, researchers have attempted to 
generate CTLs based on peptide immunization. Thus, Takahashi et al., 
[Science 255:333-336 (1992)] reported on the induction of broadly 
5 cross-reactive cytotoxic T cells recognizing an HTV envelope (gp 1 60) 
determinant However, those workers recognized the difficulty in 
achieving a truly cross-reactive CTL response and suggested that there 
is a dichotomy between the priming or restimulation of T cells, which is 
very stringent, and the elicitation of effector function, including 
10 cytotoxicity, from already stimulated CTLs. 

Wang et al. reported on elicitation of immune responses in 
mice against HIV by intramuscular inoculation with a cloned, genomic 
(unspliced) HIV gene. However, the level of immune responses 
achieved in these studies was very low. In addition, the Wang et al., 
1 5 DN A construct utilized an essentially genomic piece of HTV encoding 
contiguous Tat/7?£V-gpl60-Tat/K£V coding sequences. As is described 
in detail below, this is a suboptimal system for obtaining high-level 
• • expression of the gpI-60.. It also !s.potentially.dange!»us.bccause 

expression of Tat contributes to the progression of Karposi's Sarcoma. 
20 WO 93/17706 describes a method for vaccinating an animal 

against a virus, wherein carrier particles were coated with a gene 
construct and the coated particles are accelerated into cells of an animal. 

The instant invention contemplates any of the known 
methods for introducing polynucleotides into living tissue to induce 
25 expression of proteins. However, this invention provides a novel 
immunogen for introducing proteins into the antigen processing 
pathway to efficiently generate specific CTLs and antibodies. 

Trvlnn I Isa pe and Cndon Context 
30 The codon pairings of organisms are highly nonrandom, 

and differ from organism to organism. This information is used to 
construct and express altered or synthetic genes having desired levels of 
translational efficiency, to determine which regions in a genome are 
protein coding regions, to introduce translational pause sites into 
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heterologous genes, and to ascertain relationship or ancestral origin of 
nucleotide sequences 

The expression of foreign heterologous genes in 
transformed organisms is now commonplace. A large number of 

5 mammalian genes, including, for example, murine and human genes, 
have been successfully inserted into single celled organisms. Standard 
techniques in this regard include introduction of the foreign gene to be 
expressed into a vector such as a plasmid or a phage and utilizing that 
vector to insert the gene into an organism. The native promoters for 

10 such genes are commonly replaced with strong promoters compatible 
with the host into which the gene is inserted. Protein sequencing 
machinery permits elucidation of the amino acid sequences of even 
minute quantities of native protein. From these amino acid sequences, 
DN A sequences coding for those proteins can be inferred. DN A 

1 5 synthesis is also a rapidly developing art, and synthetic genes 
corresponding to those inferred DNA sequences can be readily 
constructed. 

Despite the burgeoning know ledge of exprc^ion systemr> 
and recombinant DNA, significant obstacles remain when one attempts 

20 to express a foreign or synthetic gene in an organism. Many native, 
active proteins, for example, are glycosylated in a manner different 
from that which occurs when they are expressed in a foreign host. For 
this reason, eukaryotic hosts such as yeast may be preferred to bacterial 
hosts for expressing many mammalian genes. The glycosylation 

25 problem is the subject of continuing research. 

Another problem is more poorly understood. Often 
translation of a synthetic gene, even when coupled with a strong 
promoter, proceeds much less efficiently than would be expected. The 
same is frequently true of exogenous genes foreign to the expression 

30 organism. Even when the gene is transcribed in a sufficiently efficient 
manner that recoverable quantities of the translation product are 
produced, die protein is often inactive or otherwise different in 
properties from the native protein. 
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It is recognized that the latter problem is commonly due to 
differences in protein folding in various organisms. The solution to this 
problem has been elusive, and the mechanisms controlling protein 
folding are poorly understood. 
5 The problems related to translational efficiency are 

believed to be related to codon context effects. The protein coding 
regions of genes in all organisms are subject to a wide variety of 
functional constraints, some of which depend on the requirement for 
encoding a properly functioning protein, as well as appropriate 
10 translational start and stop signals. However, several features of protein 
coding regions have been discerned which are not readily understood in 
terms of these constraints. Two important classes of such features are 
those involving codon usage and codon context* 

It is known that codon utilization is highly biased and varies 
15 considerably between different organisms. Codon usage patterns have 
been shown to be related to the relative abundance of tRN A 
isoacceptors. Genes encoding proteins of high versus low abundance 
; show differences in their codon preferences. The- possibility that biases 
in codon usage alter peptide elongation rates has been widely discussed. 
20 While differences in codon use are associated with differences in 

translation rates, direct effects of codon choice on translation have been 
difficult to demonstrate. Other proposed constraints on codon usage 
patterns include maximizing the fidelity of translation and optimizing 
the kinetic efficiency of protein synthesis. 
25 Apart from the non-random use of codons, considerable 

evidence has accumulated that codon/anticodon recognition is influenced 
by sequences outside the codon itself, a phenomenon termed "codon 
context." There exists a strong influence of nearby nucleotides on the 
efficiency of suppression of nonsense codons as well as missense codons. 
30 Clearly, the abundance of suppressor activity in natural bacterial 
populations, as well as the use of "termination" codons to encode 
selenocysteine and phosphoserine require that termination be context- 
dependent. Similar context effects have been shown to influence the 
fidelity of translation, as well as the efficiency of translation initiation. 
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Statistical analyses of protein coding regions of E. coli have 
demonstrate another manifestation of "codon context." The presence of 
a particular codon at one position strongly influences the frequency of 
occurrence of certain nucleotides in neighboring codons, and these 
5 context constraints differ markedly for genes expressed at high versus 
low levels. Although the context effect has been recognized, the 
predictive value of the statistical rules relating to preferred nucleotides 
adjacent to codons is relatively low. This has limited the utility of such 
nucleotide preference data for selecting codons to effect desired levels 

10 of translational efficiency. 

The advent of automated nucleotide sequencing equipment 
has made available large quantities of sequence data for a wide variety 
of organisms. Understanding those data presents substantial difficulties. 
For example, it is important to identify the coding regions of the 

15 genome in order to relate the genetic sequence data to protein 

sequences. In addition, the ancestry of the genome of certain organisms 
is of substantial interest. It is known that genomes of some organisms 
are cf mixed ancestry. Some sequences that are viral iii origin are now 
stably incorporated into the genome of eukaryotic organisms. The viral 

20 sequences themselves may have originated in another substantially 
unrelated species. An understanding of the ancestry of a gene can be 
important in drawing proper analogies between related genes and their 
translation products in other organisms. 

There is a need for a better understanding of codon context 

25 effects on translation, and for a method for determining the appropriate 
codons for any desired translational effect. There is also a need for a 
method for identifying coding regions of the genome from nucleotide 
sequence data. There is also a need for a method for controlling protein 
folding and for insuring that a foreign gene will fold appropriately 

30 when expressed in a host. Genes altered or constructed in accordance 
with desired translational efficiencies would be of significant worth. 

Another aspect of the practice of recombinant DNA 
techniques for the expression by microorganisms of proteins of 
industrial and pharmaceutical interest is the phenomenon of "codon 
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preference". While it was earlier noted that the existing machinery for 
gene expression is genetically transformed host cells will "operate" to 
construct a given desired product, levels of expression attained in a 
microorganism can be subject to wide variation, depending in part on 
5 specific alternative forms of the amino acid-specifying genetic code 
present in an inserted exogenous gene. A "triplet" codon of four 
possible nucleotide bases can exist in 64 variant forms. That these 
forms provide the message for only 20 different amino acids (as well as 
transcription initiation and termination) means that some amino acids 

10 can be coded for by more than one codon. Indeed, some amino acids 
have as many as six "redundant", alternative codons while some others 
have a single, required codon. For reasons not completely understood, 
alternative codons are not at all uniformly present in the endogenous 
DNA of differing types of cells and there appears to exist a variable 

15 natural hierarchy or "preference" for certain codons in certain types of 
cells. 

As one example, the amino acid leucine is specified by any 

of six DNA cc^ons.includiRg-»CTA,.CTC, CTG, CTT, TTA, and TTG , - , ^ 

(which correspond, respectively, to the mRNA codons, CUA, CUC, 
20 CUG, CUU, UUA and UUG)* Exhaustive analysis of genome codon 

frequencies for microorganisms has revealed endogenous DNA of E. 

coli most commonly contains the CTG leucine-specifying codon, while 

the DNA of yeasts and slime molds most commonly includes a TTA 

leucine-specifying codon. In view of this hierarchy, it is generally held 
25 that the likelihood of obtaining high levels of expression of a leucine- 

rich polypeptide by an E. coli host will depend to some extent on the 

frequency of codon use. For example, a gene rich in TTA codons will 

in all probability be poorly expressed in E. coli. whereas a CTG rich 

gene will probably highly express the polypeptide. Similarly, when 
30 yeast cells are the projected transformation host cells for expression of a 

leucine-rich polypeptide, a preferred codon for use in an inserted DNA 

would be TTA. 

The implications of codon preference phenomena on 

recombinant DNA techniques are manifest, and the phenomenon may 
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serve to explain many prior failures to achieve high expression levels of 
exogenous genes in successfully transformed host organisms-a less 
"preferred" codon may be repeatedly present in the inserted gene and 
the host cell machinery for expression may not operate as efficiently. 
5 This phenomenon suggests that synthetic genes which have been 

designed to include a projected host cell's preferred codons provide a 
preferred form of foreign genetic material for practice of recombinant 
DNA techniques. 

10 Protein Trafficking 

The diversity of function that typifies eukaryotic cells 
depends upon the structural differentiation of their membrane 
boundaries. To generate and maintain these structures, proteins must be 
transported from their site of synthesis in the endoplasmic reticulum to 

15 predetermined destinations throughout the cell. This requires that the 
trafficking proteins display sorting signals that are recognized by the 
molecular machinery responsible for route selection located at the 
* . access points to the mairrtraffsckmg pathways. Sorting decisions for 
most proteins need to be made only once as they traverse their 

20 biosynthetic pathways since their final destination, the cellular location 
at which they perform their function, becomes their permanent 
residence. 

Maintenance of intracellular integrity depends in part on 
the selective sorting and accurate transport of proteins to their correct 
25 destinations. Over the past few years the dissection of die molecular 
machinery for targeting and localization of proteins has been studied 
vigorously. Defined sequence motifs have been identified on proteins 
which can act as 'address labels*. A number of sorting signals have been 
found associated with the cytoplasmic domains of membrane proteins. 

30 

SUMMARY OF THE INVENTION 

This invention relates to novel formulations of nucleic acid 
pharmaceutical products, specifically nucleic acid vaccine products. 
The nucleic acid products, when introduced directly into muscle cells, 
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induce the production of immune responses which specifically recognize 
Hepatitis C virus (HCV). 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 Figure 1 shows the nucleotide sequence of the VIRa vector. 

Figure 2 is a diagram of the VIRa vector. 
Figure 3 is a diagram of the Vtpa vector. 
Figure 4 is the VUb vector 

Figure 5 shows an optimized sequence of the HCV core 

10 antigen. 

Figure 6 shows VIRa.HCVlCorePAb, Vtpa.HCVlCorePAb 
and VUb.HCVlCorePAb. 

Figure 7 shows the Hepatitis C Virus Core Antigen 

Sequence. 

15 Figure 8 shows codon utilization in human protein-coding 

sequences (from Lathe et ah). 

Figure 9 shows an optimized sequence of the HCV El 

protein. : 

Figure 10 shows an optimized sequence of the HCV E2 

20 protein. 

Figure 1 1 shows an optimized sequence of the HCV El +E2 

proteins. 

Figure 12 shows an optimized sequence of the HCV NS5a 

protein. 

25 Figure 1 3 shows an optimized sequence of the HCV NS5b 

protein. 

DETAILED DESCRIPTION OF THE INVENTION 

This invention relates to novel formulations of nucleic acid 
30 pharmaceutical products, specifically nucleic acid vaccine products. 

The nucleic acid vaccine products, when introduced directly into muscle 
cells, induce the production of immune responses which specifically 
recognize Hepatitis C virus (HCV). 
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Non-A, Non-B hepatitis (NANBH) is a transmissible disease 
(or family of diseases) that is believed to be virally induced, and is 
distinguishable from other forms of virus-associated liver disease, such 
as those caused by hepatitis A virus (HAV), hepatitis B virus (HBV), 
5 delta hepatitis virus (HDV), cytomegalovirus (CMV) or Epstein-Barr 
virus (EBV). Epidemiologic evidence suggests that there may be three 
types of NANBH: the water-borne epidemic type; the blood or needle 
associated type; and the sporadically occurring (community acquired) 
type. However, the number of causative agents is unknown. Recently, a 

10 new viral species, hepatitis C virus (HCV) has been identified as the 
primary (if not only) cause of blood-associated NANBH (BB-NANBH). 
Hepatitis C appears to be the major form of transfusion-associated 
hepatitis in a number of countries, including the United States and 
Japan. There is also evidence implicating HCV in induction of 

15 hepatocellular carcinoma. Thus, a need exists for an effective method 
for preventing or treating HCV infection: currently, there is none. 

The HCV may be distantly related to the flaviviridae. The 
. Fiavivirus family contains a large number of viruses which are small, 
enveloped pathogens of man. The morphology and composition of 

20 Fiavivirus particles are known, and are discussed in M. A. Brinton, in 
"The Viruses: The Togaviridae And Flaviviridae" (Series eds. Fraenkel- 
Conrat and Wagner, vol. eds. Schlesinger and Schlesinger, Plenum 
Press, 1986), pp. 327-374. Generally, with respect to morphology, 
Flaviviruses contain a central nucleocapsid surrounded by a lipid 

25 bilayer. Virions are spherical and have a diameter of about 40-50 nm. 
Their cores are about 25-30 nm in diameter. Along the outer surface of 
the virion envelope are projections measuring about 5-10 nm in length 
with terminal knobs about 2 nm in diameter. Typical examples of the 
family include Yellow Fever virus, West Nile virus, and Dengue Fever 

30 virus. They possess positive-stranded RNA genomes (about 1 1 ,000 
nucleotides) that are slightly larger than that of HCV and encode a 
polyprotein precursor of about 3500 amino acids. Individual viral 
proteins are cleaved from this precursor polypeptide. 
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The genome of HCV appears to be single-stranded RNA 
containing about 10,000 nucleotides. The genome is positive-stranded, 
and possesses a continuous translational open reading frame (ORF) that 
encodes a polyprotein of about 3,000 amino acids. In the ORF, the 

5 structural proteins appear to be encoded in approximately the first 
quarter of the N-terminal region, with the majority of the polyprotein 
attributed to non-structural proteins. When compared with all known 
viral sequences, small but significant co-linear homologies are observed 
with the nonstructural proteins of the Flavivirus family, and with the 

10 pesti viruses (which are now also considered to be part of the Flavivirus 
family). 

Intramuscular inoculation of polynucleotide constructs, i.e., 

DNA plasmids encoding proteins have been shown to result in the 

generation of the encoded protein in situ in muscle cells. By using 
15 cDNA plasmids encoding viral proteins, both antibody and CTL 

responses were generated, providing homologous and heterologous 

protection against subsequent challenge with either the homologous or 

crosr-strain protection, respectively. Each of these types -of immune v : - ; 

respoases offers a potential advantage over existing vaccination 
20 strategies. The use of PNVs (polynucleotide vaccines) to generate 

antibodies may result in an increased duration of the antibody responses 

as well as the provision of an antigen that can have both the exact 

sequence of the clinically circulating strain of virus as well as the 

proper post-translational modifications and conformation of the native 
25 protein (vs. a recombinant protein). The generation of CTL responses 

by this means offers the benefits of cross-strain protection without the 

use of a live potentially pathogenic vector or attenuated virus. 

The standard techniques of molecular biology for 

preparing and purifying DNA constructs enable the preparation of the 
30 DNA therapeutics of this invention. While standard techniques of 

molecular biology are therefore sufficient for the production of the 

products of this invention, the specific constructs disclosed herein 

provide novel therapeutics which surprisingly produce cross-strain 
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protection, a result heretofore unattainable with standard inactivated 
whole virus or subunit protein vaccines. 

The amount of expressible DNA to be introduced to a 
vaccine recipient will depend on the strength of the transcriptional and 
5 translational promoters used in the DNA construct, and on the 
immunogenicity of the expressed gene product. In general, an 
immunologically or prophylactically effective dose of about 1 |ig to 1 
mg, and preferably about 10 |ig to 300 \ig is administered directly into 
muscle tissue. Subcutaneous injection, intradermal introduction, 
10 impression through the skin, and other modes of administration such as 
intraperitoneal, intravenous, or inhalation delivery are also 
contemplated. It is also contemplated that booster vaccinations are to be 
provided. 

The DNA may be naked, that is, unassociated with any 

15 proteins, adjuvants or other agents which impact on the recipients 
immune system. In this case, it is desirable for the DNA to be in a 
physiologically acceptable solution, such as, but not limited to, sterile 
saline or sterile buffered saline. Alternatively, the DNA may be 
associated with surfactants, liposomes, such as lecithin liposomes or 

20 other liposomes known in the art, as a DNA-liposome mixture, (see for 
example WO93/24640) or the DNA may be associated with an adjuvant 
known in the art to boost immune responses, such as a protein or other 
carrier. Agents which assist in the cellular uptake of DNA, such as, but 
not limited to, calcium ions, detergents, viral proteins and other 

25 transfection facilitating agents may also be used to advantage. These 
agents are generally referred to as transfection facilitating agents and as 
pharmaceutically acceptable carriers. As used herein, the term gene 
refers to a segment of nucleic acid which encodes a discrete polypeptide. 
The term pharmaceutical, and vaccine are used interchangeably to 

30 indicate compositions useful for inducing immune responses. The terms 
construct, and plasmid are used interchangeably. The term vector is 
used to indicate a DNA into which genes may be cloned for use 
according to the method of this invention. 



WO 97/47358 



PCT/US97/098S4 



- 15 - 



The following examples are provided to further define the 
invention, without limiting the invention to the specifics of the 
examples. 



5 FX AMPLE 1 

Vll REPRESSION VECTORS: 

VI J is derived from vectors VI and pUC18, a 
commercially available plasmid. VI was digested with Sspl and EcoRI 
restriction enzymes producing two fragments of DN A. The smaller of 
10 these fragments, containing the CMVintA promoter and Bovine Growth 
Hormone (BGH) transcription termination elements which control the 
expression of heterologous genes, was purified from an agarose 
electrophoresis gel. The ends of this DNA fragment were then 
"blunted" using the T4 DNA polymerase enzyme in order to facilitate 
15 its ligation to another "blunt-ended" DNA fragment. 

pUC18 was chosen to provide the "backbone" of the 
expression vector. It is known to produce high yields of plasmid, is 
well-characterized by sequence and function, and is cf minimum size. 
We removed the entire lac operon from this vector, which was 
20 unnecessary for our purposes and may be detrimental to plasmid yields 
and heterologous gene expression, by partial digestion with the Haell 
restriction enzyme. The remaining plasmid was purified from an 
agarose electrophoresis gel, blunt-ended with the T4 DNA polymerase , 
treated with calf intestinal alkaline phosphatase, and ligated to the 
25 CMVintA/BGH element described above. Plasmids exhibiting either of 
two possible orientations of the promoter elements within the pUC 
backbone were obtained. One of these plasmids gave much higher 
yields of DNA in E. coli and was designated V 1 J. This vector's 
structure was verified by sequence analysis of the junction regions and 
30 was subsequently demonstrated to give comparable or higher expression 
of heterologous genes compared with VI. The ampicillin resistance 
marker was replaced with the neomycin resistance marker to yield 
vector VUneo. 
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An Sfi I site was added to VUneo to facilitate integration 
studies. A commercially available 13 base pair Sfi I linker (New 
England BioLabs) was added at the Kpn I site within the BGH sequence 
of the vector. VI Jneo was linearized with Kpn I, gel purified, blunted 
5 by T4 DNA polymerase, and ligated to the blunt Sfi I linker. Clonal 
isolates were chosen by restriction mapping and verified by sequencing 
through the linker. The new vector was designated VI Jns. Expression 
of heterologous genes in VI Jns (with Sfi I) was comparable to 
expression of the same genes in VI Jneo (with Kpn I). 

10 Vector VIRa (Sequence is shown in Figure 1; map is shown 

in Figure 2) was derived from vector V1R, a derivative of the VI Jns 
vector. Multiple cloning sites (fl#/Il, Kpnl, EcoRV, EcoRl, Sail, and 
NotV) were introduced into V1R to create the VIRa vector to improve 
the convenience of subcloning. VIRa vector derivatives containing the 

15 tpa leader sequence and ubiquitin sequence were generated (Vtpa 
(Figure 3) and Vub (Figure 4), respectively). Expression of viral 
antigen from Vtpa vector will target the antigen protein into the 
exocyiic pathway, thus producing a secre&ble form of the antigen 
proteins. These secreted proteins are likely to be captured by 

20 professional antigen presenting cells, such as macrophages and dendritic 
cells, and processed and presented by class II molecules to activate CD4+ 
Th cells. They also are more likely to efficiently simulate antibody 
responses. Expression of viral antigen through VUb vector will 
produce a ubiquitin and antigen fusion protein. The uncleavable 

25 ubiquitin segment (glycine to alanine change at the cleavage site, Butt et 
al., JBC 263:16364, 1988) will target the viral antigen to ubiquitin- 
associated proteasomes for rapid degradation. The resulting peptide 
fragments will be transported into the ER for antigen presentation by 
class I molecules. This modification is attempted to enhance the class I 

30 molecule-restricted CTL responses against the viral antigen (Townsend 
etal, JEM 168:1211, 1988). 
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EXAMPLE 2 

nESlfiN AND CONSTRUCTION OF THE SYNTHE TIC GENES 

A Dftsipn of Synthetic Gene Segments for H CV Gene Expression: 
5 Gene segments were converted to sequences having 

identical translated sequences (except where noted) but with alternative 
codon usage as defined by R. Lathe in a research article from J. Molec. 
Biol. Vol. 183, pp. 1-12 (1985) entitled "Synthetic Oligonucleotide 
Probes Deduced from Amino Acid Sequence Data: Theoretical and 
10 Practical Considerations". The methodology described below was based 
on our hypothesis that the known inability to express a gene efficiently 
in mammalian cells is a consequence of the overall transcript 
composition. Thus, using alternative codons encoding the same protein 
sequence may remove the constraints on HCV gene expression. 
1 5 Inspection of the codon usage within HCV genome revealed that a high 
percentage of codons were among those infrequently used by highly 
expressed human genes. The specific codon replacement method 
employed may be described as follows employing data from Lathe et 
al.: 

20 1 • Identify placement of codons for proper open 

reading frame. 

2. Compare wild type codon for observed frequency of 

use by human genes (refer to Table 3 in Lathe et al.). 

3 . If codon is not the most commonly employed, 

25 replace it with an optimal codon for high expression based on data in 
Table 5. 

4. Inspect the third nucleotide of the new codon and the 
first nucleotide of the adjacent codon immediately 3'- of the first. If a 
5'-CG-3' pairing has been created by the new codon selection, replace it 

30 with the choice indicated in Table 5. 

5. Repeat this procedure until the entire gene segment 

has been replaced. 

6. Inspect new gene sequence for undesired sequences 
generated by these codon replacements (e.g., "A l l l A" sequences, 
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inadvertent creation of intron splice recognition sites, unwanted 
restriction enzyme sites, etc) and substitute codons that eliminate these 
sequences. 

7. Assemble synthetic gene segments and test for 
5 improved expression. 

B. HCV CORE ANTIGEN SEQUENCE 

The consensus core sequence of HCV was adopted from a 
generalized core sequence reported by Bukh et al. (PNAS, 91:8239, 
10 1994). This core sequence contains all the identified CTL epitopes in 
both human and mouse. The gene is composed of 573 nucleotides and 
encodes 191 amino acids. The predicted molecular weight is about 23 
kDa. 

The codon replacement was conducted to eliminate codons 
15 which may hinder the expression of the HCV core protein in transfected 
mammalian cells in order to maximize the translational efficiency of 
DNA vaccine. Twenty three point two percent (23.2%) of nucleotide 
, r ;equenee ( 1 33 put of 573 nucleotides) were altered, resulting in changes 
of 61 3% of the codons (1 17 out 191 codons) in the core antigen 
20 sequence. The optimized nucleotide sequence of HCV core is shown in 
Figure 5. 

C. CONSTRUCTION OF THE SYNTHETIC CORE GENE 

The optimized HCV core gene (Figure 5) was constructed 
25 as a synthetic gene annealed from multiple synthetic oligonucleotides. 
To facilitate the identification and evaluation of the synthetic gene 
expression in cell culture and its immunogenicity in mice, a CTL 
epitope derived from influenza virus nucleoprotein residues 366-374 
and an antibody epitope sequence derived from SV40 T antigen residues 
30 684-698 were tagged to the caiboxyl terminal of the core sequence 
(Figure 6). For clinical use it may be desired to express the core 
sequence without the nucleoprotein 366-374 and S V40 T 684-698 
sequences. For this reason, the sequence of the two epitopes is flanked 
by two EcoRl sites which will be used to excise this fragment of 



WO 97/47358 



PCT/US97/09884 



19 



sequence at a later time. Thus an embodiment of the invention for 

clinical use could consist of the VlRa.HCVlCorePAb, 

Vtpa.HCVlCorePAb, or VUb.HCVlCorePAb plasmids that had been 

cut with EcoRI, annealed, and ligated to yield plasmids 
5 V 1 Ra.HCV 1 Core, Vtpa.HCV 1 Core, and VUb.HCV 1 Core. 

The synthetic gene was built as three separate segments in 

three vectors, nucleotides 1 to 80 in VIRa, nucleotides 80 to 347 {BstXl 

site) in pUCl 8, and nucleotides 347 to 573 plus the two epitope 

sequence in pUC18. All the segments were verified by DNA 
1 0 sequencing, and joined together in V 1 Ra vector. 

D. HCV Gene Expression Coastmcts: 

In each case, the junction sequences from the 5' promoter 
region (CMVintA) into the cloned gene is shown. The position at which 
15 the junction occurs is demarcated by a "/", which does not represent any 
discontinuity in the sequence. 

•.- The nomenclature for mese constrwcts fpHcws the . . '"T- •■- " ; 

convention: "Vector name-HCV strain-gene". 

20 

VlRa.HCVl.CorePAb 

—IntA-AGA TCT ACC / ATG AGC-HCV.Core.-GCC / GAA TTC GCT TCC- 
PAb Sequence-TAA / ACC CGG GAA TTC TAA A / GTC GAC--BGH — 

25 Vtpa.HCVl.CorePAb 

— IntA-ATC ACC / ATG GAT-tpa leader-GAG ATC-TTC / ATG AGC- 
HCV.Core.~GCC / GAA TTC GCT TCC-PAb Sequence-TAA / ACC CGG GAA 
TTC TAA A /GTC GAC-BGH— 

30 VUb.HCV l.CorePAb. 

—IntA-AGA TCC ACC / ATG CAG-Ubiquitin-GGT GCA GAT CTG/ ATG AGC- 
HCV.Core.-GCC / GAA TTC GCT TCC-PAb Sequence-TAA / ACC CGG GAA 
TTC TAA A / GTC GAC-BGH— 
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VlRa.HCVl.Corc 
— IntA— AGA TCT ACC / ATG AGC-HCV.Corc.-GCC / TAA A / GTC GAC-- 
BGH— 

5 Vtpa.HCVl.Core 

— IntA-ATC ACC / ATG GAT-tpa leader~GAG ATC-TTC / ATG AGC-- 
HCV.Core.-GCC / TAA A / GTC GAC--BGH — 

VUb.HC V 1 .Core 

10 — IntA-AGA TCC ACC / ATG CAG-Ubiquitin-GGT GCA GAT CTG/ ATG AGC- 
HCV.Corc.-GCC / TAA A / GTC GAC-BGH-- 

E. OTHER SYNTHETIC HCV GENES 

Using similar codon optimization techniques, synthetic 
15 genes encoding the HCV El (Figure 9), HCV E2 (Figure 10), HCV 

E1+E2 (Figure 11), HCV NS5a (Figure 12) and HCV NS5b (Figure 13) 
proteins were created. 
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WHAT IS CLAIMED: 



1 . A synthetic polynucleotide comprising a DN A 
sequence encoding an HCV protein selected from the group consisting 

5 of HCV core protein, HCV El protein, HCV E1+E2 protein, HCV NS5a 
protein, HCV NS5b protein and fragments thereof, the DNA sequence 
comprising codons optimized for expression in a vertebrate host. 

2. A plasmid vector comprising the polynucleotide of 
10 Claim 1, the plasmid vector being suitable for immunization of a 

vertebrate host. 



15 
20 
25 



3. The polynucleotide of Claim I which is HCV 
genotype I/Ia core. 



241 WCCCTWX 
41H *n3>>>:TACAT 

4*1 oaaTn^AAiT 



AcCOcAAfliV 

AitxtArriV: 



The polynucleotide of Claim 1 having the sequence 

cCAriftViAAo AiXAAi |dt AOAO'AACaG «>*>jfVt:rAi : " " 



•ft/TACcTCcT iXtV^lAttT 
A»5W>|CAi5: OcATOTCAA 
CAAHJAoOJC TTti3GcTt5a; 
i^dWiV^ iiAAvcTOili ■* 
i » ftp X? Oft C Vr&fc&Sffft 



•5>:crf>Mi« CCtiSAO? a.V 

aaotr vvrt r, Mi.Ainnr'ft tv: 

rpVtA{V»Vr CTWCtCATO 



»^At>rrttAAfrr -nrvtiaana; hu 
ipxt AAiywvivti: imj 

(.a-IWHaar* cCA*Tt*H>&: 

cvstfixrti f*:Ttvk«:i:T\iA 4w 
tsji7iviAf»5/r rpm>yu^:yvi 4&f> 
c*nxiv-TO"T i^rniAdwrr sto 

<-7* 



30 
35 
40 
45 
50 
55 



i 

si 

161 
241 

J21 

<ni 

481 

'.M 
Ml 
721 
&01 
n»l 
*»G1 
1*V41 

nit 
12*1 

1 *M 
1441 
l r .21 
1W>1 

1'.h; 

176! 
Ih41 
:i21 
JO01 
1M1H1 
2lf.1 

J>J1 
^401 



5. The plasmid vector of Claim 2 having the sequence 

K. — ATAATAT5TA OATTTATATT i>3CTCA*n3TC CAAi.'ATTW V 



JATATPaa.T ATMAVATT tJL'ATA OTm* TAT^XATATi 
i S VATCTH 3A 0-ATTV.ATTAT TCACTAfiTTA TWATW2TAA 
•nXIXTCTTAT ATAACTTAirO •TtAAATIiX'C CWCWTC 

TA*nm>xTA TtosiMow aataoaiact TO?ATn^i: 

AfTTACATCAA 'JUTTATVATA T*X»?AAl3TAr iWTTATT 
AUTACA*fl;AC CTTAHJATAC TTTXTACTT (3>3CA*yrACAT 
TWA* TATA TCAATO*3CO H3lJATftlX»>3 TTT»5A<^T>:A»- 

TTPmrn*; cmxmmwt AAoaa^.TT t*.v aaaat v:t 
imTTAorn^ oyonvTAT ATAAta/Aiyw: fTtirmwrr 

GACCPXATA <?AM;ArAOX: tSJACrTAUX- A*3fXTW»3 

lyurroAorm AirmnwcT ata*jaitjvta TM»rccAcr 
onvTATArA oxwottc* invATrrrAT Aonwnarr 
itactcccct ATnamiAn:: ATAcrrmrA mrmA^i' 
totaataca cnjrcimvA imjacpsaca o racpmyr 

Ai'ATATACAA CAiYAOOTP: OXAiTIWCV «X*AiJTnTTA 

Trrm^-aw ai^w^tt mvtartAra;- ■aa^a.wr-Tr 
•Trnx-maa: AiXTorrox tv\taai;ai3t <>;A«aa:rA*^A 



laxiarcway; 



AfAAOVVTr 
AAiaa.TAlSOV. 
tTnAAi^TP; 

A»:Aiyvi"i>nT tXTrm*A'n: 



■mnjpjrnv; aaaa*p;a*3*t 

tJAMXX'AiTn* T5WmmSA*a.* AiJTAirmTTT 

umrrrnvT o^wstcaws 



iiM* IAj.'TAAG AOiiAACAiXA Ai'A03A*53>' CTAJJiATCP:: 

•nxTt a/o;*; A*aa-iva.a>a;; jxtac 

CAiXWATO* tXAAOVCA*.: (VW5>:tT»JA*3 *X'»^^^T*"»"T 

Ai'sxTrn'av ■naaamiAT warn^^nw nnx'Ajssw 
»VAi5JyWi"T t>">arAA*57P*: atti^a*^- •n'Ai^TPTns.: 
i^rnm>a^Atl i^?Tlt>aTAt; (ja-nHWT t;A*n*TS3TCA 

(a.*i*T»?'aT»: TvnTimvA t> tt^^.t ia.a.vcp.arrc 

t. wm| u i.*»m»' Aiavaa.*i.i;n; ATTnrrnv: ormrTAirrT 
tyn">rima5AA hticc<:ait»* i.va».t»w>t ttxtaataa 



t"L> 

TfAATTAOX; OGTTATTAUT 7VATA*5»ri?CA TATAI*?^ >T 1 f<0 
Al^aXTAAC OACO^lTt^ rAmSAOmC AATXAHlAi^"! 240 

iTrrAAHMTr n3H»3wrm corrAAAcns oxAcm^x J2n 

TATiTPrAA'PS Ai^STTAAA'P* iWOXCIW OATTA'P'Jr'^r 40l» 

•ttaottatta imrA , n > t5rra TTAo.*A*n>TT rAim^-JTrrr 4ho 
»>5>3ATTrr^: AA*rn*n.-t*A*; ox-atosao; "n/AAHJa w: sr»n 

OiTfAACAACT OS.SCWATT t;ACnCAAATC ilOl-^irnMaa' 040 

*yuM;iirP"Ai3 A^s^rpvj^ i^AnavAi*^: A«>?r!»TrnT 7211 
tx-ojajAAi^; •mx^An^iA ArarajAnv iwt.wAA aou 

0. TrntaanT inTATO/AH^ rrATAf-nrrr Tn^^TPJi a«u 

ATAi JTTAi^- i"TATA»>rPrr «aaTTIATTlSA »XATTAT05A *»W*. 

ATAA*>na>- Tn-mnTrAr AA rifi vrrr ATn>^nAiA hmu 
ATrnTACA»j •jA-n>a>Ti»rr «:atttattat tpa»:aaatt>^ n^u 

TTAAACATAA iXrpajTJVPT iXAOXXMtT «Tl.iaamM. , »; I2»lt 

CTAi'A'n.i^TA o-ivn-a-n^t- rAn^rrrrA o^ta: w ?.*A'p: 12»i: 

1. TTA<JJLAl.A *>.'AOiAT>.a.V CMVACCMV A)rprPVO>* MM' 
t.i.a.»JiM ATa : ».a.TPA*AO.r: «."I*iAt.i3;ATT *P.a *AAi *A:rrr 144U 
•pSTTCTViATA Ai^A».n>.'A*JiA*i i?BAAWi Tr».a.t a r^XT IVJti 

urnA^a^a* laiai'AiXA*; acataataiv t>^a> aivu*t?s iun: 
•p^rrrAi^tr r.«-»:Aiiyyx Ai.VAAO.rcA A»>r»"»VA»yv; H.vt 
AAmTo^. •a>a^a>t*A 'JATlvnvaa^A owrni^v I7*u;i 
t.'AtX^AAtAt.i: T.-n^»yu»:rr oy-AiavvAi; '»aa'Ai>w>: iM4i> 
■ >»?i:rAar h33CTa«>w mniTr-iur atua'aah.^ i^-h 
■p>'A«aaiiT i;*ernaaaai"i* «-ai;a»ja».i^X* A*ay^;a^^rr ~idm> 
iTrmATiyM" t^n3A*n»*>T ArATO.vrrr (ama>Tia>T 2o«u 
ta*a7l»>~pa5A la^-paaarn; AAJTATaTA i"n?>:AA'r>T 2iuu 
vrv?xnsA CAi^na^Ta; tp.-pjivmh ttcgcttcca ^^4n 
n;itrit?jt:rrc rtCi:ttp;trr^ .tt>t.utt .».vu- »:rip,p.ia*itrt: 2^2i- 

ia.vAiA>:*A'n' T»:rn»7Tn».a: txrTiyx^'^yi •PjOitt.ttp J4or 

AAT» JW? AAA Tn'J*AT>ia;A TPrTf^niWrT A»rrrT::ATT J4sn 
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10 



15 



20 



24B1 
2*.61 
2M1 
27J1 
2K01 
Z*H1 
2'>61 
1041 
♦121 
1201 

*ifcl 
t441 

tWl 

«an 

if. HI 
*7M 
i*41 
r»21 
4001 
4U«1 

4it»i 

4J41 



(7TATTtm> H 

■n.'iAii.iirm 

tto/» yw^riT 
A*>rnvwp*A 
rrTATO.iXTr 

Trn.sjwn.T* 
itarrAtxsxrr 

AAACHXTAAT 

■XJAlXXTWTIT 

toxoctott 
tvatio7iv;a 

tYaai.XAOSA 
OXtMSSATT 

mnvAtxxA 

TAAAT*7VX*A 
TATTAi-PTPr 
Ti^^/ATAA 



oxmxixnu 
trrA*xwn>' 
AAAAusai.ia* 

•H^Ti^lAAAO" 
iXXt^CTTAiV 

AACTATOTir 
iJAUTTAHTTA 

urrrrnrn; 
<xt7fAA*Pxr 

TTATTCATAT 
•XATAOATC 
CAAAAATA*; 
TDXA*SAiTT 

tpsoxxtvsa 
acacuxvw 
t yAim aynsA 

ijPTTAi3T»r*n3 
ivrr^XATO 

TWTT AAiX'A 

c*:m jxthv 



iXM:*X"A*.'A 

Aunxmr* 
im>x-iW"Xi 
ojacaosact 
la^TAcrnsr 

msAtrnxAA 

iJXtWTTA 
iJJUWVAtTIT 
TTOX'AAtiCA 

ciWAurorr 

OA* iSATTATV 
•VAATJnrrT 

» tttaivaai it 
iTnvAAi'Ai?; 

OISTAIVAACA 

irnutfLt.'AHA- 

AiXATVTVAT 
rAATrrJtt*: 

AATTTAAT»i: 
i.^:At3TTTTA 



tX"AA03£XiA 
ATTAAO XXO 
TTTnrCATA 
AUkAAiSATAO 

*xi>xnTt:T 

TtXAAlXTW 
OXiXTTAAiSA 

•wywrirTT 

AlVTnnxykA 
CTAfCTlftrc 
TAf'AMXAAT 
AATATVATAT 

iSTPATn-WTV 
iVWiAAATOAt.' 
iXAiXVATTA 

ATOTTTT*~A»r 
AIY'AICAttM 

ATT»rn»At: 
ixoxnv,*: 
TPrrTrA"niA 



iriAm.MiAA 
i:Atxwv*?r 
rjXTVX^v 

eAtx»xrrm: 
«xtdiw;a 

O^TPTD^'A 
CAOSACTFAT. 

UAAunxmx; 

AAA*3A0TT» S> 
tlJCAHAAAAA 
TAAO TAATTV 
TTTOJAAAAA 

T»X^OTPX\: 

osctoticat 

tX-pTTTAAAA 
CHSAAICAO^ 
tTTAOXATAA 
ATPXa.'AAii: 

ctcatiwx 

CAA*SA*^jTTT 
T*3ATATATTT 



fiWAATAOCA 
AOXAOTPX 

OXC1»>JAA«: 

Aiavnr» mv 

rt ;AArXi 

OXXAOIOSr 

iXTAACTAO* 

t*x"tctii3a 
aaosatttca 

"PiATTfciAAA 

tXXtSTTIVTC 

A*Tonvi:AA 

CAAAATCACT 

i>yU.'AATTAO 
ATOTP "n»T 
AATXTn^T 
'."EWITTTO 3 ■ 

rAi'ATTA'nt: 

O^TmJVAT 
TTATTPTP: 



*Xs;A*nX"PX: 
T*jAAi3AATTG 
CATCACAAAA 

rnx-'rroTn; 

TTTt.**ICAA*n: 

AOrAfXXACT 
tXTACACTA^ 

txwaaat 
AtaAwmT 

AATTCWrcUA 
^ATCAATATA 

(xmraAAi*; 

WATVAAiv 
AAAl -Al » MAT 
AATWmSiA 

i.'ATinTP.'A'^ 

AT* > JCTCATA 
fAAHTTAATA 



ixwiwxro 1 : 
ATVAiTa-n; 
orc-Aoa-nrr 

ACOXHX**** 

•>rrAA*7At >.yv 

AAtVJVTAi.TA 
AAAiVAiTVl' 
T03AT>.TnT 
«J» , ATOAAA*flj 
iCAAAAiTVAO 
A'^TATTAAT 
■•X'AAAAiXTT 
AAACOJTTAT 
'.-i^AAT WAAr 
AT»>rnj"rrrr 
'^XVlTAAATT 
AAACAATTr-r 
TATAO.^A'm 
A'*At.*cl.-!-TP: 
I^AtiAl 1ATTT 



-'Ml' 
272U 
£MMi 

1 1 1 4 1 * 

> M.ii 
*S2» 

JtUM 

\"*t.Q 

4'», r t. 
4t>Ua 
41 Mi 

4^* 1 



25 6. The polynucleotide of Claim 4 from which the PAb 

sequence has been removed. 

7. The plasmid vector of Claim 5 from which the PAb 
sequence has been removed. 

30 

8. A method for inducing immune responses in a 
vertebrate against HCV epitopes which comprises introducing between I 
ng and 100 mg of the polynucleotide of Claim 1 into the tissue of the 
vertebrate. 

35 

9. A method for inducing immune responses against 
infection or disease caused by HCV which comprises introducing into 
the tissue of a vertebrate the polynucleotide of Claim 1. 

40 10. A vaccine for inducing immune responses against 

HCV infection which comprises the polynucleotide of Claim I and a 
pharmaceutical^ acceptable carrier. 

11. A method for inducing anti-HCV immune responses 
45 in a primate which comprises introducing the polynucleotide of Claim 1 
into the tissue of said primate and concurrently administering 
interleukin-12 parenterally. 
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12. A method of inducing an antigen presenting cell to 
stimulate cytotoxic and helper T-cell proliferation an effector functions 
including lymphokine secretion specific to HCV antigens which 

5 comprises exposing cells of a vertebrate in vivo to the polynucleotide of 
Claim 1. 

13. A method of treating a patient in need of such 
treatment comprising administering to the patient the polynucleotide of 

10 Claim 1 in combination with interferon-alpha, Ribavirin, Zidovudine, 
or other pharmaceutical ly acceptable antiviral agents.. 

1 4. A pharmaceutical composition comprising the 
polynucleotide of Claim 1 . 

15 

15. A method of inducing an immune response 
comprising administering the polynucleotide of Claim 1 to a patient, the 
administration of ^ or coinciding or 
following administration to the patient of a subunit, recombinant, 

20 recombinant live vector, inactivated, recombinant inactivated vector, or 
live attenuated HCV vaccine. 

16. A method for inducing immune responses in a 
vertebrate against HCV epitopes which comprises introducing between 1 

25 ng and 100 mg of the polynucleotide of Claim 2 into the tissue of the 
vertebrate. 

17. A method for inducing immune responses against 
infection or disease caused by HCV which comprises introducing into 

30 the tissue of a vertebrate the polynucleotide of Claim 2. 

18. A vaccine for inducing immune responses against 
HCV infection which comprises the polynucleotide of Claim 2 and a 
pharmaceutical^ acceptable carrier. 
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19. A method for inducing anti-HCV immune responses 
in a primate which comprises introducing the polynucleotide of Claim 2 
into the tissue of said primate and concurrently administering 

5 interleukin 12 parenterally. 

20. A method of inducing an antigen presenting cell to 
stimulate cytotoxic and helper T-cell proliferation an effector functions 
including lymphokine secretion specific to HCV antigens which 

10 comprises exposing cells of a vertebrate in vivo to the polynucleotide of 
Claim 2. 

21 . A method of treating a patient in need of such 
treatment comprising administering to the patient the polynucleotide of 

15 Claim 2 in combination with interferon-alpha, Ribavirin, Zidovudine, 
or other pharmaceutically acceptable antiviral agents.. 

. 22. A pharmaceutical composition comprising the 
polynucleotide of Claim 2. 

20 

23. A method of inducing an immune response 
comprising administering the polynucleotide of Claim 2 to a patient, the 
administration of the polynucleotide antedating or coinciding or 
following administration to the patient of a subunit, recombinant, 

25 recombinant live vector, inactivated, recombinant inactivated vector, or 
live attenuated HCV vaccine. 

24. The vector of Claim 2 which is selected from 
VlRa.HCVlCorePAb, Vtpa.HCVlCorePAb, VUb.HCVlCorePAb, 

30 V 1 Ra.HCV 1 Core, Vtpa.HCV 1 Core and VUb.HCV 1 Core. 



25. A pharmaceutical composition comprising the vector 

of Claim 21. 
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26. The DNA sequence of Claim 1 selected from the 
group consisting of a nucleotide sequence shown in Figure 5, Figure 9, 
Figure 10, Figure 11, Figure 12 and Figure 13. 
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FIG.4 
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TABLE 3 

CODON UTILIZATION IN HUMAN PROTEIN-COOING SEQUENCES 



f 
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42 


0.09 








CUU 


50 


0.11 








cue 


99 


0.22 








CUA 


30 


0.07 








cue 


204 


0.46 






1 


AUU 


28 


0.23 


123 


2.9 




AUC 


79 


0.64 








AUA 


16 


0.13 






U 


AUC 


77 


1X0 


77 


1.8 


V 


GUU 


35 


0.13 


266 


6.2 




GUC 


72 


0.27 








bUA 












GUG 


134 


0.50 






s 


UCU 


59 


0.17 


349 


8.1 




UCC 


91 


0.26 










XI 
«!/ 


U. 1 1 








UCG 


25 


0.07 








ACU 


37 


0.11 








ACC 


100 


0.29 






p 


ecu 


51 


0.24 


212 


A3 




CCC 


86 


0.41 








CCA 


51 


0.24 








CCG 


24 


0.11 






T 


ACU 


47 


0.20 


238 


5.6 




ACC 


113 


0.47 








ACA 


50 


0.21 








ACC 


28 


0.12 






A 


ecu 


91 


0JI 


298 


7.0 




GCC 


119 


0.40 








GCA 


51 


0.17 








GCG 


37 


0.12 







Y 


UAU 


72 


0.47 


153 


3.6 




UAC 


81 


0.53 






H 


CAU 


44 


0.42 


105 


2.5 




CAC 


61 


058 






Q 


CAA 


50 


0.26 


192 


4.5 




CAG 


142 


0.74 






N 


AAU 


51 


0.34 


148 


3.5 




AAT 


97 


0.66 






K 


AAA 


137 


0.45 


303 


7.0 




AAG 


166 


0.55 






D 


GAU 


79 


0J8 


209 


A A 

4.9 




GAC 


130 


0.62 






F 

L. 


GAA 


125 


0.40 


311 


7J 




GAG 


186 


0.60 






C 


UGU 


44 


0J0 


147 


3.4 




UGC 


103 


0.70 






w 


UGG 


56 


1.00 


56 


U 


R 


CGV 


19 


0.09 


215 


5.0 




CGC 


40 


0.19 








CCA 


22 


0.10 








CGG 


33 


0.15 








AGA 


51 


0.24 








AGG 


50 


0.23 






G 


GGU 


36 


0.15 


245 


5.7 




GGC 


108 


0.44 








GGA 


42 


0.17 








GGG 


59 


0.24 







TOTAL 4285 RESIDUES EXCLUDING 
N -TERMINAL METHIONINE RESIDUES 



FIG.8 
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